System, method and computer program for improved forecasting residual values of a durable good over time

ABSTRACT

A residual value forecasting system may utilize heterogeneous data, such as used market data, industry-specific data, and non-industry-specific data, from disparate data sources to produce residual value forecasts of an item based on a sophisticated residual value forecasting model particularly configured for agility. The system can dynamically and quickly adapt to change in data inputs and produce custom outputs. The system may determine a baseline value for an item using the used market data, a microeconomic factor using the industry-specific data, and a macroeconomic factor using the non-industry-specific data, as well as adjustments such as locality adjustments and modifications. Given the macroeconomic factor and the microeconomic factor relative to the locally adjusted value of the item and in view of the competitive sets of similar and/or substitute items in the same industry, the system can generate an accurate forecast residual value of the item at a future time point.

CROSS-REFERENCE TO RELATED APPLICATIONS

This is a conversion of, and claims a benefit of priority from U.S. Provisional Application No. 62/406,786, filed Oct. 11, 2016, entitled “SYSTEM, METHOD AND COMPUTER PROGRAM FOR IMPROVED FORECASTING RESIDUAL VALUES OF A DURABLE GOOD OVER TIME,” which is fully incorporated by reference herein. This application is a continuation-in-part of U.S. patent application Ser. No. 15/423,026, filed Feb. 2, 2017, entitled “SYSTEM, METHOD AND COMPUTER PROGRAM FOR FORECASTING RESIDUAL VALUES OF A DURABLE GOOD OVER TIME,” which is a continuation of U.S. patent application Ser. No. 13/967,148, filed Aug. 14, 2013, now U.S. Pat. No. 9,607,310, entitled “SYSTEM, METHOD AND COMPUTER PROGRAM FOR FORECASTING RESIDUAL VALUES OF A DURABLE GOOD OVER TIME,” which is a conversion of, and claims a benefit of priority from U.S. Provisional Application No. 61/683,552, filed Aug. 15, 2012, entitled “SYSTEM, METHOD AND COMPUTER PROGRAM FOR FORECASTING RESIDUAL VALUES OF A DURABLE GOOD OVER TIME,” all of which are hereby incorporated by reference as if set forth herein in their entireties.

TECHNICAL FIELD

This disclosure relates generally to forecasting future market value of durable goods, and more particularly to improved systems, methods and computer program products for forecasting the value of an item using microeconomic, macroeconomic, and competitive set information and updating the forecast value at predetermined time intervals.

BACKGROUND OF THE RELATED ART

The market value of an item is known at the time that it is sold to a consumer. After this initial transaction, however, the value of the item will decline. The amount by which the value decreases may depend upon many factors, such as the amount of time that has passed since the original sale, the amount of wear experienced by the item, and so on.

Because of the difficulty of determining these factors with any certainty, the value of an item after its initial sale is conventionally determined by resale values of the item. For instance, the value of a two-year-old automobile is determined by examining the prices for which similarly equipped automobiles of the same make, model and year have actually sold. While some adjustments may be made to these values (e.g., for vehicle mileage above or below some average range), determination of the automobile's value generally relies on past resale prices of the same vehicle.

Since these conventional methods of determining the value of an item are relatively simplistic and take into account only backward-looking data (e.g., past sales of the item), they are not as accurate as may be desired. For instance, an automobile leasing company may need to know the future value of the automobiles that it owns in order to obtain financing for expansion or other business transactions. It would therefore be desirable to provide improved methods for determining the future value of such items.

SUMMARY OF THE DISCLOSURE

This disclosure is directed to new and improved systems, methods and computer program products for forecasting future values of an item that solve one or more of the problems discussed above. An object of the invention is to provide realistic and adjusted residual values of a durable good (item) over the item's lifecycle to reflect the market, incentives and purchases. Another object of the invention is to provide accurate, reliable residual values across items being valued such that manufacturers can market their items with clear, consistent messages based on accurate, reliable forecasts. Yet another object of the invention is to provide relevant and timely residual values that reflect product enhancements, packaging, and/or content adjustments made to items being valued. Yet another object of the invention is to provide residual values that have utility to each manufacturer's ecosystem. Such residual values may encompass all phases of a durable good sales cycle, for instance, from dealer engagement, manufacturer support, cooperation on pricing, to off-lease supply management.

These and other objects of the invention may be realized in a residual value forecasting system embodied on one or more server machines particularly configured for generating forecasted future values (residual values) of an item, for instance, a high-value durable good such as a vehicle. The system may utilize various types of data received and/or obtained from disparate data sources over a network to produce variations of residual value forecasts of the item based on a new and improved residual value forecasting model. Particularly configured for agility, the system can dynamically and quickly adapt to change in data inputs and produce new outputs (referred to herein as “deliverables”), such as a blended or customized forecast, to client devices. In addition to agility, the new and improved residual value forecasting model disclosed herein can also change how deliverables are produced by implementing a significantly more sophisticated residual value forecasting algorithm.

In some embodiments, a residual value forecasting method implementing a special residual value forecasting algorithm may include receiving, by a system implementing the method and operating in a network computing environment, a request from a client device for a residual value forecast of an item. For the purpose of illustration, and not of limitation, the item can be a vehicle or any high-value durable good that does not wear out quickly or that yields utility over time. Responsively, the system may determine a baseline value for the vehicle, based on a given configuration of the vehicle, and determine a reference period at which adjustments to the baseline value may be made. The reference period may begin at an initial time and ends a period of time from the initial time (“referred to as the forecast time”). The initial time may be the day of the request or a day in the past. The period of time may be a number of months such as 24-month, 36-month, etc.

The residual value forecasting method may further comprise determining locality adjustment(s) to the vehicle; collecting or estimating incremental values of modifications to the base configuration of the vehicle; determining locally adjusted value of the modified vehicle; constructing competitive sets of similar and/or substitute vehicles in the same industry, for instance, the used vehicle industry; collecting macroeconomic data and determining a macroeconomic factor based on the collected macroeconomic data; collecting microeconomic data and determining a microeconomic factor based on the collected microeconomic data; and generating a forecast residual value of the vehicle at the forecast time, given the macroeconomic factor and the microeconomic factor relative to the locally adjusted value of the modified vehicle and in view of the competitive sets of similar and/or substitute vehicles in the same industry.

In some embodiments, the residual value forecasting method may further comprise performing at least a quality assurance process. The quality assurance process may entail comparing the forecast residual value of the vehicle with residual values of vehicles in the competitive sets, computing adjustments accordingly, and generating a final residual value for the vehicle.

In some embodiments, the residual value forecasting method may leverage linear regression modeling techniques to provide purely data science driven outputs with high R-squared values, for instance, at least approximately 80% to 85%. Skilled artisans appreciate that linear regression calculates an equation that minimizes the distance between a fitted line and all of the data points. R-squared is a statistical measure of how close the data are to the fitted regression line. In the context of this disclosure, this statistical measure provides quantitative evidence in how the new and improved residual value forecasting model can alone explain a significantly higher percentage of the variance in the dependent variable, without user intervention, oversight processing, or any qualitative feedback cycle (referred to herein as “qualitative input”) to the model output. As skilled artisans can appreciate, high reliance on qualitative input can affect accuracy of values in a negative way based on processing inefficiencies.

The significant reduction of non-efficient qualitative input enables a system implementing the residual value forecasting method disclosed herein to perform significantly more efficiently and reduce processing times and resources such as computer systems used. The system may optionally allow efficient qualitative input, if desired. Efficient qualitative input may be much more dedicated to non-processing matters such as efficient quality assurance (QA) of the output. This focus, in turn, can result in producing more accurate and higher quality output.

One embodiment may comprise a system having a processor and a memory and configured to implement a method disclosed herein. One embodiment may comprise a computer program product that comprises a non-transitory computer-readable storage medium which stores computer instructions that are executable by at least one processor to perform the method. Numerous other embodiments are also possible.

These, and other, aspects of the disclosure will be better appreciated and understood when considered in conjunction with the following description and the accompanying drawings. It should be understood, however, that the following description, while indicating various embodiments of the disclosure and numerous specific details thereof, is given by way of illustration and not of limitation. Many substitutions, modifications, additions and/or rearrangements may be made within the scope of the disclosure without departing from the spirit thereof, and the disclosure includes all such substitutions, modifications, additions and/or rearrangements.

BRIEF DESCRIPTION OF THE DRAWINGS

Other objects and advantages of the invention may become apparent upon reading the following detailed description and upon reference to the accompanying drawings.

FIG. 1 depicts a diagrammatic representation of an example of system architecture, according to some embodiments disclosed herein.

FIG. 2 depicts a diagrammatic representation of an example of various types of data collected by an example of an enterprise computer system in which embodiments disclosed herein may be implemented.

FIG. 3A depicts a diagrammatic representation of an example of a network computing environment implementing a variety of processes particularly configured for processing various types of data from disparate data sources and providing outputs to client device(s), according to some embodiments disclosed herein.

FIG. 3B depicts a diagrammatic representation of an example of a network computing environment similar to the network computing environment shown in FIG. 3A, without certain specific types of data, according to some embodiments disclosed herein.

FIG. 3C depicts a plot diagram comparing two residual curves generated with (FIG. 3A) and without (FIG. 3B) certain specific types of data, according to some embodiments disclosed herein.

FIG. 4 is a process flow illustrating the acquisition of various types of data from disparate data sources and preparation of input data for a residual value forecasting method, according to some embodiments disclosed herein.

FIG. 5 is a process flow illustrating an example of a residual value forecasting method, according to some embodiments disclosed herein.

FIG. 6 is a flow diagram illustrating an example of a method for optionally revising a generated residual value curve based on qualitative input via a feedback cycle, according to some embodiments disclosed herein.

FIG. 7 is a flow diagram illustrating an example of a method for optionally allowing a client to provide qualitative input on a generated residual value curve, according to some embodiments disclosed herein.

FIG. 8 depicts a plot diagram illustrating an example of a residual value curve, according to some embodiments disclosed herein.

FIG. 9 depicts a plot diagram illustrating an example of a residual value curve adjusted based on competitive set comparison, according to some embodiments disclosed herein.

FIG. 10 depicts a bar diagram illustrating the effects of modification adjustments and locality adjustments, according to some embodiments disclosed herein.

FIG. 11 depicts a plot diagram illustrating percentage points adjustments by factor, according to some embodiments disclosed herein.

FIG. 12 depicts a diagrammatic representation of a user interface of a workbench application, according to some embodiments disclosed herein.

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS

Embodiments of the invention and various features and advantageous details thereof are explained more fully with reference to the non-limiting embodiments that are representatively illustrated in the accompanying drawings and detailed in the following description. Descriptions of well-known starting materials, processing techniques, components and equipment are omitted so as not to unnecessarily obscure the invention in detail. It should be understood, however, that the detailed description and specific examples, while indicating exemplary and representative embodiments of the invention, are given by way of illustration only and not by way of limitation. Various substitutions, modifications, additions or rearrangements are within the spirit or scope of this disclosure and will become apparent to those skilled in the art from this disclosure.

For the purposes of this disclosure, the term “item” may be used to refer to a durable good, product, or any item that has a known value at the time it was first sold and that has a different resale value over time thereafter. Examples of an item may include a vehicle, a real estate property, etc.

The resale value of an item may be affected by various factors such as time, the availability of same or similar items, the geographical location where the item physically resides, demand in for the item in the resale market and/or industry, the purchasing power of the target buyers, and so on. An ability to determine the amount by which the item will change (e.g., devalue) over time, and thereby forecast the resale or residual value of the item can provide a better understanding of a company's assets and can allow the company to make better decisions.

FIG. 1 depicts a diagrammatic representation of an example of system architecture, according to some embodiments disclosed herein. For purposes of clarity, a single client computer 110, a single server computer 140, and a single data source 160 are shown in the example of FIG. 1. Client and server computers 110, 140, and data source 160 each represents an exemplary hardware configuration of a data processing system capable of bi-directionally communicating with other networked systems and devices over a network such as the Internet. Those skilled in the art will appreciate that enterprise computing environment 130 may comprise multiple server computers, and multiple client computers and data sources may be bi-directionally coupled to enterprise computing environment 130 over network 120.

Client computer 110 can include central processing unit (“CPU”) 111, read-only memory (“ROM”) 113, random access memory (“RAM”) 115, hard drive (“HD”) or storage memory 117, and input/output device(s) (“I/O”) 119. I/O 119 can include a keyboard, monitor, printer, and/or electronic pointing device. Example of I/O 119 may include mouse, trackball, stylist, or the like. Client computer 110 can include a desktop computer, a laptop computer, a personal digital assistant, a cellular phone, or nearly any device capable of communicating over a network. Server computer 140 may have similar hardware components including CPU 141, ROM 143, RAM 145, HD 147, and I/O 149. Data source 160 may include a server computer having hardware components similar to those of client computer 110 and server computer 140, or it may be a network-enabled data storage device.

Each computer shown in FIG. 1 is an example of a data processing system. ROM 113 and 143, RAM 115 and 145, HD 117 and 147, and database 150 can include media that can be read by CPU 111 and/or 141. Therefore, these types of computer memories exemplify non-transitory computer-readable storage media. These memories may be internal and/or external to computers 110 and/or 140.

Portions of the methods described herein may be implemented in suitable software code that may reside within ROM 143, RAM 145, HD 147, database 150, or a combination thereof. In some embodiments, computer instructions implementing an embodiment disclosed herein may be stored on a direct access storage device (DASD) array, magnetic tape, floppy diskette, optical storage device, or any appropriate non-transitory computer-readable storage medium or storage device. A computer program product implementing an embodiment disclosed herein may therefore comprise one or more computer-readable storage media storing computer instructions translatable by CPU 141 to perform an embodiment of a method disclosed herein.

In an illustrative embodiment, the computer instructions may be lines of compiled C++, Java, or other language code. Other architectures may be used. For example, the functions of server computer 140 may be distributed and performed by multiple computers in enterprise computing environment 130. Accordingly, each of the computer-readable storage media storing computer instructions implementing an embodiment disclosed herein may reside on or accessible by one or more computers in enterprise computing environment 130.

FIG. 2 depicts a diagrammatic representation of an example of various types of data collected by an example of enterprise computer system 140 in which embodiments disclosed herein may be implemented. In this example, enterprise computer system 140, which may be embodied on one or more server machines operating in enterprise computing environment 130, may receive, obtain, or otherwise collect various types of data 161-166. Before describing these data types in detail, an overview of residual value forecasting methodology may be helpful.

The current market value of a durable good (“item”) is known at the time of sale, t₀, but its resale value at some future time points, t_(n)>t₀, may be largely unknown. In this disclosure, a forecast of such a resale value can be generated by computing a special function with estimated coefficients.

An ability to forecast the resale—or “residual”—value of item provides a better understanding of the amount by which the item will devalue over fixed interval of time. If the aim is to determine the amount of devaluation of an item that will occur between time period m and n (Δ_((n,m))=t_(n)−t_(m)), one must compute:

ΔV _(i,(n,m)) =V _(i,n) −V _(i,m)  [Equation 1]

where V_(i,m)=the value of item i at time t_(m)

-   -   V_(i,n)=the value of item i at time t_(n).

Though the change in valuation between any single time point and a future time point requires that Δ_((n,m))=(t_(n)−t_(m)) be greater than or equal to 0, there is no restriction on the algebraic sign of the change in value during that time period as an item may increase in value as time elapses. Briefly referring to FIG. 8, the market value of item i in the current period, t₀, is V_(i,0) but continually declines over time. After a period Δ_((n,m))=(t₁−t₀), the change in value of item i is Δ_((n,m))=V_(1,0)<0. The change in value between Δ_((n,m))=(t₂−t₀) is ΔV_(i(n,m))=V_(i)<0 and the change in value between Δ_((n,m))=(t₂−t_(i)) IS ΔV_(i(n,m))=V_(i,2)−V_(i,1)<0. Though the devaluation over time requires that Δ_((n,m))=(t_(n)−t_(m)) be greater than or equal to 0, there is no restriction on the algebraic sign of the change in value during that time period as an item may increase in value as time elapses.

A major complication that arises in determining the residual value of an item at a future time point, V_(i,n), will not actually be known until t_(n). This complication suggests that some type of forecasting must be conducted in order to estimate residual values in time periods that have not yet been reached. This disclosure provides a methodology for forecasting residual values in two time periods, t_(m) and t_(n), and enables the construction of a change in valuation metric ΔV_(i,(n,m)). By estimating the changes in value for successive future time intervals, one can then construct a function that captures the estimated relationship between time and the item's value. In this approach, a residual value forecasting model is built to predict V_(i,n) for any time period 0≦n≦T. As forecast interval is relative to the baseline, Δ_((n,0))=(t_(n)−t₀), the farther away in time a forecast is relative to the baseline, the more uncertainty will exist. Accordingly, the forecasting error ε_(n,0) will grow as the width of the time interval, Δ_((n,0)), increases.

Taking this uncertainty into consideration, embodiments utilize different types of data to aid in forecasting residual values of an item over time. Example data types include, but are not limited to, modifications to the items, locality of the items, microeconomic factors, macroeconomic factors, and sets of competitive items. Special variables representing these data types will be discussed in more detail below.

Modifications 161 reflect any changes to item i that may affect its value at any time point. Examples of modifications (M_(i)) include options added to the item in prior periods, different configurations/styles of the item, or other features which may distinguish one item from another that is produced by the same manufacturer.

Locality 162 represents valuation differences of item i in an industry (p), the valuation differences varying geographically (iεp). Examples of Locality (L_(p)) would include adjustments to equalize sales of the essentially identical items made in different locations, allowing valuation to be conducted, for instance, at both the national and state/province levels.

Depreciation 163 represents the natural change in value that occurs as item i is used over time. Depreciation (D_(i)) can be determined from past sales of the item. Some embodiments of a residual value forecasting method may not need to rely on D_(i) in generating a residual value forecast.

Microeconomic 164 represents information specific to the industry p to which item i is associated (iεp). For example, microeconomic factors (G_(p)) may include supply and/or demand specific to the industry p, industry trends, seasonality, and/or volatility of the item, or information about a company that is in the industry p. For example, segment supply and model supply are specific to the automotive industry and thus are considered microeconomic factors specific to the automotive industry. In contrast, the overall industry supply is considered a macroeconomic impact factor as it affects the overall automotive economy.

Macroeconomic 165 represents information that is non-specific to the item and/or its industry. Macroeconomic factors (F) may relate to the overall economy, rather than to the specific industry with which item i is associated (e.g., the real estate or automotive industries). Examples of macroeconomic information may include gas prices, inflation, unemployment rate, interest rates, industry-wide used market supply, etc. All vehicles (e.g., fleet vehicles, lease/loan financed vehicles, cash paid vehicles) are generally expected to return to the used market (e.g., used vehicles offered at dealers, used vehicles transacted from private parties to private parties, etc.) in a given period of time with certain ages (e.g., 1-5 year-old vehicles). As an example, when a new car is leased, at some point in time that vehicle is expected to be returned to a bank after the lease is up and the returned vehicle most likely will be offered in the used market for sale by a dealer. Such items or units in the overall industry-wide used market supply can be in the millions. For example, over 11 million units of vehicles ages 1-5 years can be expected to return every year to the used market.

Competitive sets 166 represent information that relates to items that compete with the item of interest. Competitive sets (C_(iU)) include all other items, j=1, . . . , J (i≠j), in the same industry p and in the competitive set U (i,jεU∀j) which are similar and/or are reasonable substitutes for item i being valued. Examples of competitive items j may include items produced by different manufacturers that share similarities (e.g., similar vehicle attributes such as miles per gallon, engine type, transmission type, sports package, weather package, technology package, etc.) with item i being valued. Competitive sets 166 may also include information relating to sales incentives applied to competitive items j. Competitive sets 166 may further include information relating to sales or recall information for competing items.

Leveraging these particular data types, the new and improved residual value forecasting model described below can be applied to any durable good—that is, items not immediately consumed and retaining some non-negative value over time. In some embodiments, model variable representing the particular data types described above may encompass the various components of the residual value forecasting model required to value item i in any industry p. Specifically, the microeconomic (G_(p)), Locality (L_(p)), and competitive sets (C_(iU)) components are specific to an industry p pertaining to item i that is being valued as long as all other members, j=1, . . . , J, of the competitive set U are in the same industry, p, as item i.

A system implementing the new and improved residual value forecasting model may operate to quickly adapt to different types of input data and, as such, can dynamically produce differentiating outputs (also referred to as “deliverables” or “information products”) useful for various purposes such as data analyses. This useful agility and flexibility of the new system is illustrated in FIGS. 3A-3C.

FIG. 3A depicts a diagrammatic representation of an example of a network computing environment 130 having residual value forecasting system 140 embodied on one or more server machines and implementing a variety of processes 400, 500, 600, and 700 particularly configured for processing various types of data received, obtained, or otherwise collected (simultaneously, periodically, continuously, or at different times/frequencies such as daily, weekly, monthly, quarterly, etc. over various communications channels and protocols such as File Transfer Protocol (FTP)) over network 120 from disparate data sources 361, 363, 365, 367, and 369 (which can, for instance, include a FTP server) and providing outputs 300 b to client device(s) 110, according to some embodiments disclosed herein. Processes 400, 500, 600, and 700 are described in detail below. In some embodiments, raw data from disparate data sources 361, 363, 365, 367, and 369 can be stored in database 150. In some embodiments, processed data from processes 400, 500, 600, and 700 can be stored in database 150.

As illustrated in FIG. 3A, examples of types of data that may be received, obtained, or otherwise collected from various data sources may include data specific to an industry relating to a particular item (referred to herein as “industry-specific data”) and data not specific to any industry (referred to herein as “non-industry-specific data”) such as inflation, unemployment rate, etc. Additionally, residual value forecasting system 140 may receive, obtain, or otherwise collect different types of auction data and certified data. For example, residual value forecasting system 140 may receive, obtain, or otherwise collect open auction data, closed auction data, and certified pre-own data. Skilled artisans appreciate that, although FIG. 3A shows a data source per data type, this need not be the case. A single data source may provide residual value forecasting system 140 with one or more of these data types and multiple data sources may provide residual value forecasting system 140 with the same type of data. Accordingly, FIG. 3A is meant to be illustrative and non-limiting.

Similarly, FIG. 3B depicts a diagrammatic representation of residual value forecasting system 140 that receive, obtain, or otherwise collect various types of data from data sources 361, 363, and 369. In this example, residual value forecasting system 140 may consider the various types of data from data sources 361-369 in determining a residual value forecast for an item i, but may not include closed auction data and/or certified pre-owned data in its computation.

In the past, residual values of an item were calculated with significantly less heterogeneous data types than those shown in FIGS. 3A and 3B. For example, a system programmed to compute residual values for a used vehicle may rely solely on the wholesale/auction prices of used vehicles. This limits the system to rigidly producing a single type of output—residual values for a used vehicle. Embodiments of a residual value forecasting system (e.g., residual value forecasting system 140) disclosed herein is particularly programmed to operably take (and/or receive) a variety of data from disparate upstream data sources (e.g., data sources 361-369 shown in FIG. 3A or FIG. 3B, explained above), process them accordingly (explained below), and utilize them in various computations to produce information products (e.g., a custom output tailored to a customer's request, see e.g., FIG. 3C) which can have different utilities in downstream applications/scenarios. These computational processes allow the system to be agile, flexible, and robust in creating useful information products, not just for the wholesale/auction used vehicle market, but also for other industries such as the retail vehicle market, vehicle data providers, vehicle lease management, fleet management, etc. The impact of the system can be significant. For instance, a single point of residual value can have a billion dollar impact in the automotive marketplace.

FIG. 3C depicts a plot diagram comparing two residual curves generated with (e.g., FIG. 3A) and without (e.g., FIG. 3B) certain specific types of data, according to some embodiments disclosed herein. As illustrated in FIG. 3C, when residual value forecasting system 140 includes closed auction data and certified pre-owned data in addition to open auction data and other factors in determining a residual value forecast (curve) for item i over a 60-month period, custom output 300 a consistently provides forecasted residual values higher than those indicated by output 300 b. In some cases, one or both outputs (information products) may be presented (e.g., via a user interface) on a client device, allowing a user to utilize the output(s) to view, analyze, and/or take appropriate action such as setting a required bank reserve, as exemplified below.

Skilled artisans appreciate that there can be many useful applications of embodiments disclosed herein. For example, residual values generated by exemplary residual value forecasting system 140 disclosed herein (e.g., outputs 300 a and 300 b illustrated in FIG. 3C), can be used to estimate the value of automobiles over time and therefore allow one to determine the resale value that could be expected at future time points. Examples of automobiles may include nearly all passenger and light trucks available to consumers in the United States and Canada. Furthermore, the generated residual values can provide guidelines for pricing fixed-term vehicle leases which captures the expected change in value that will result in the time interval between the leased vehicle's acquisition at time t₀ and its disposition at time t_(d). In some embodiments, not only the estimated residual value of item i can be provided at disposition (V_(i,d)), but forecasted values of item i can also be provided at equally-spaced fixed time points between t₀ and t_(d), thereby allowing construction of a residual curve that captures the relationship between vehicle value and time. Over time, and as new information becomes available, residual value forecasting system 140 may update the stored forecasts to reflect changing values of exogenous macroeconomic and industry-specific microeconomic variables and vehicle-specific, endogenous variables (e.g., depreciation, competitive sets, modifications, etc.).

Referring to FIGS. 4-7, examples of processes 400, 500, 600, and 700 are shown. Processes 400, 500, 600, and 700 may be implemented, for example, in residual value forecasting system 140 as shown in FIG. 1. It should be noted that the particular steps illustrated in FIGS. 4-7 are exemplary, and the steps of alternative embodiments may vary from those shown in FIGS. 4-7.

Referring to FIG. 4, process flow 400 illustrates the acquisition of various types of data from disparate data sources and preparation of input data for a residual value forecasting method (e.g., process 500 shown in FIG. 5, described below), according to some embodiments disclosed herein. In some embodiments, process 400 may be part of a residual value forecasting system embodied on one or more server computer (e.g., enterprise computer system 140) operating in an enterprise computing environment (e.g., enterprise computing environment 130) and specially programmed to implement a residual value forecasting method disclosed.

The residual value forecasting system may initially query data source(s) for information of various types described above (405). The data sources may include those (e.g., data storage units) that are internal to the enterprise computing environment and those that are external to the enterprise computing environment. In one embodiment, the residual value forecasting system may employ data crawlers that are particularly programmed to programmatically and automatically (e.g., periodically or continuously) query external data sources, including those operating in disparate network computing environments and conditions, searching for information relevant to generating a certain forecast, for instance, responsive to a request for a custom forecast from a client device communicatively connected to the residual value forecasting system over a network. The request from the client device may include information on a particular vehicle Year/Make/Model/Type and a specified time period. Optionally, the request may indicate a desired data type or data types to be used in generating the forecast. Alternatively or additionally, the residual value forecasting system may systematically and automatically generate various forecasts estimating the values of different vehicle Years/Makes/Models/Types over different time periods and lengths and may push the various forecasts thus generated to different client devices (which can be owned and operated by different entities/owners). Optionally, a registered user (e.g., a subscriber) who has an account with the residual value forecasting system may log in remotely to search and/or review a particular forecast or forecasts generated by the residual value forecasting system.

The residual value forecasting system may receive, obtain, or otherwise collect the data from these data sources (410) and store the collected data for further processing (415). The collected data is examined by the residual value forecasting system (420) and processed to identify portions of the data that will be used to generate the forecast.

The data may be “scrubbed” by the residual value forecasting system (425) in order to provide a better basis for the forecast. The scrubbing process may involve the residual value forecasting system performing various techniques to improve the quality of the data, such as identifying data that appears to be in error, removing outlying data points that substantially deviate from the remainder of the data, and so on. The data may also be filtered or examined by the residual value forecasting system to identify particular fields or types of data within the data that has been collected by the residual value forecasting system.

Still further, the residual value forecasting system may transform all or part of the collected data into forms (e.g., data representations having a normalized and/or common data structure internal to the residual value forecasting system) that are suitable for use/consumption by the residual value forecasting system. Such forms can include data structures for mapping incoming vehicle data to a vehicle code system (e.g., ALG vehicle code system), cleaning up data issues (e.g., manual entry errors that exist in the incoming vehicle data), adjusting transaction prices to certain assumptions such as normalized mileage per year, etc. After the desired data has been selected and scrubbed, if necessary, the modified data set can be stored (430) in a local data storage device, from which it can be retrieved and used by the residual value forecasting system in the generation of the forecast.

FIG. 5 is a process flow illustrating an example of residual value forecasting method 500, according to some embodiments disclosed herein. In this example, residual value forecasting method 500 may comprise determining a baseline value for an item with a base configuration (501); determining a reference period at which adjustments are to be made to the item (503); determining a constant width of time intervals at which forecasts are to be generated for the item (505); determining locality adjustment(s) to the item (507); collecting or estimating incremental values of modifications to the base configuration of the item (509); determining a locally adjusted value of the modified item at the forecast time (511, see, e.g., FIG. 10); constructing competitive sets of similar and/or substitute items in the same industry (513); collecting macroeconomic data and determining a macroeconomic factor (515); collecting industry-specific microeconomic data and determining a microeconomic factor (517); and generating a residual value of item at the forecast time (519, see, e.g., FIG. 11). Optionally, residual value forecasting method 500 may further comprising performing one or more quality assurance (QA) operations on the generated output (521, see, e.g., FIG. 12) and adjusting, if necessary, to generate a final forecast of residual value of the item (523, see, e.g., FIG. 3C). Note that construction of the residual value forecasts requires performing some steps at certain milestones in the lifetime of the item (e.g., at t₀ and at any time period when any modification is made), while others may be performed at each time period for which the item's value is to be forecasted. The steps are further described in detail below.

In some embodiments, a system implementing residual value forecasting method 500 may determine a baseline value for item i with a base configuration (501), for instance, by taking an h-month historical average (V_(i,h)) of data points of particular data types (e.g., used market values, wholesale/auction values, etc.) collected by the system. This operation may be triggered by a request from a client device communicatively connected to the system over a network, by an instruction or command from an administrator of the system (e.g., via an administrative tool of the system), or automatically by a programmed trigger or scheduled event.

Under most circumstance, recent historical market values are available for computing V_(i,h) (which represents the h-month historical average baseline value for item i with a base configuration). V_(i,h) may be expressed below as a function of time, t_(n) (n=0, . . . , T), taking a h-month historic average of the market values of item i at time t₀ before modifications.

V _(i,h)=(V _(i,0) +M _(i,n))×(τ_(i,n) ×L _(p,n))+(β₁ ΔF _(n|n-h)+β₂ ΔG _(p,n|n-h))+C _(iU,n|n)*  [Equation 2.1]

V _(i,h)=BV_(i,0)+(β₁ ΔF _(n|n-h)+β₂ ΔG _(p,n|n-h))+C _(iU,n|n*)  [Equation 2.2]

where h=1, . . . , H. As a non-limiting example, H may represent a value of 24.

Equation 2.2 represents another way to express V_(i,h) where (V_(i,0)+M_(i,n))×(τ_(i,n)×L_(p,n))=BV_(i,0). These special model variables are described in more detail below.

V_(i,0) represents an initial value at the beginning of the estimation period, t₀. V_(i,0) may be obtained through direct observation of the recent market values. Once V_(i,0) is known, it can be used as a baseline against which future values are computed.

V_(i,n) reflects the level of the model variable for item i at period t_(n).

M_(i,n) represents incremental values of modifications to the base configuration of item i of interest.

τ (tau) represents the locality adjustment coefficient where

$\begin{matrix} {\tau_{i,n} = \left\{ \begin{matrix} {{1\mspace{14mu} {if}\mspace{14mu} t_{n}} = 0} \\ {0\mspace{14mu} {otherwise}} \end{matrix} \right.} & \left\lbrack {{Equation}\mspace{14mu} 3} \right\rbrack \end{matrix}$

For example, τ=1 if t=0 (meaning for used values being observed currently) where BV_(i,0)=(V_(i,0)+M_(i,n))×(τ_(i,n)×L_(p,n)) represents recent market values vary by region (τ=1). In embodiments that do not forecast regional values, no locality adjustment is made (τ=0) for future values (forecast) if t>0.

L_(p,n) reflects the locality adjustment L_(p) made at time t_(n) to all items in industry p (iεp).

ΔF_(−,n|n−h) reflects the change in the macroeconomic (neither industry-specific nor item-specific) variable t_(n)−t₀, given the historical information about that variable in the last h=1, . . . , H periods (t_(n-1), t_(n-2), . . . , t_(n-H)).

ΔG_(p,n|n-h) reflects the change in the microeconomic variable t_(n)−t₀, given the historical information for industry p (iεp) available about that variable in the last h=1, . . . , H periods (t_(n-1), t_(n-2), . . . , t_(n-H)).

β₁ reflects the set of the coefficient(s) of the macroeconomic (neither industry-specific nor item-specific) variable, given the historical information about that variable in the last h=1, . . . , H periods (t_(n-1), t_(n-2), . . . , t_(n-H)),

β₂ reflects the set of the coefficient(s) of the microeconomic variable, given the historical information for industry p (iεp) available about that variable in the last h=1, . . . , H periods (t_(n-1), t_(n-2), . . . , t_(n-H)),

C_(iU,n|n*) reflects a competitive set adjustment made to item i based at time period t_(n*) based on an observed discrepancy between V_(i,n) and the predicted values of all other items, j=1, . . . , J (i # j) in the competitive set U (i,jεU∀j) evaluated at some reference period, t_(n).

BV_(i,0) represents the baseline value of item i at t=0, adjusted for modifications M_(i,n) and locality L_(p,n).

The output (V_(i,h)) of Equation 2.1 or 2.2 represents an h-month historical average current market value expressed in t_(0-n) months historical average, reflecting the market information across all localities, Z, in which item i is available.

If a baseline value cannot be determined or obtained directly for item i, the system may construct K competitive sets, U_(k), of similar and/or substitute items in the same industry and select the most similar item j (i≠j) as a substitute (see Equation 6) and use its value.

If the substitute item j's value was constructed in a time period before t₀, the system may escalate the value based on inflation values for industry p in which items i and j are assigned.

As discussed above, the farther away in time a forecast is relative to the baseline value, the more uncertainty will exist and the more forecasting error ε_(n,0) may exist. This seemingly unavoidable nature of forecasting future residual values can be highly undesirable, if not detrimental, to certain entities that rely on knowledge of the future residual values to make important decisions, sometimes with severe consequences, if the forecasted residual values are less than accurate. For example, knowledge of the future residual values may be useful to some entities some entities in setting leasing rates which reflect the expected change in valuation between the beginning and ends of a fixed lease period. As another example, knowledge of the future residual values may be useful to some entities in determining the amount at which an item can be resold at any time period—a useful metric that can be used in investment decisions such as real estate. As yet another example, knowledge of the future residual values may be useful to some entities in providing information supporting the strategic planning decisions made of the manufacturer of item i.

Furthermore, knowledge of the future residual values may be useful in understanding and determining whether the change in value will be constant over time intervals of the same length. For example, returning briefly to FIG. 8, the change (ΔV_(i,(1,0))) between V_(i) at t₀ (represented by V0 in FIG. 8) and V_(i) at t₁ (represented by V1 in FIG. 8) is larger than the change (→ΔV_(i,(2,1))) between V_(i) at t₂ (represented by V2 in FIG. 8) and V_(i) at t₁ (represented by V1 in FIG. 8). Constant changes in valuation over all periods of equal length, Δ_((n,m))=Δ_((m,p)) (m≠p) would result in a function between time and value represented by a straight line (increasing, decreasing or flat) while non-constant changes would be represented by a non-linear function.

To understand the relationship between residual values and time, embodiments employ both historical and current data. For example, if there is an underlying monthly seasonality in the residual values over time, it would take a few years of historical data to be able to detect, measure, or estimate the amount of seasonal variation. Additionally, it would be difficult to forecast residual values for an interval Δ_((n,m)) if the historical data used to construct the forecasting model has a length Δ<Δ_((n,m)). An additional data constraint results from the frequency at which the data used to construct the model (e.g., macroeconomic, microeconomic, competitive sets, etc.) is updated. If each of r=1, . . . , R input variables (not to be confused with the Q and R notations explained below) has an update frequency of φ_(r), then the frequency at which the residual forecasts can be updates is φ*=min_(r)(φ_(r)).

The knowledge of whether a residual value curve is linear or non-linear may be deterministic as to how the effect of potential time degradation is handled. For example, although the first observation (the time period when item i first becomes available on the market) is indexed at t₀, in some cases, a user of the residual forecast relationship may be interested in using a later time period, t_(s)≧t₀, as a starting point from which changes in valuation are assessed—for instance, if item i will not be purchased until t_(s) and will remain in the seller's inventory until then. The anchor point for the curve remains fixed at t₀, but the evaluation of the curve shift from Δ_((n,0)) to Δ_((n+s,s)) (on the horizontal axis) and from ΔV_(i,h) to ΔV_(i,(n+s,s)) (on the vertical axis). If the residual value curve was linear, the shifting of the time evaluation window by s periods would have no impact on the value change. However, in the cases where the residual value curve is non-linear, the appropriate time starting point should be chosen to account for the time degradation effect that occurs as item i remains in its original state.

Although the baseline value (V_(i,h)) of item i is known and remains unchanged, the forecast of residual value of item i needn't also remain fixed over time. As new information becomes available that is reflected in the variable types discussed above (e.g., variables in Equation 2.1 or 2.2 representing data types 161-166), it is possible to employ that additional information to update the forecasted residual value of item i.

Accordingly, in some embodiments, the system may operate to determine, at time t_(n)=0, a reference period, t_(n)*, at which adjustments are to be made to item i to align the baseline value of item i with values of other items in a competitive set of similar and substitute items in the same industry p as item i (503).

The reference period, t_(n)*, may be determined in consideration of the following constraints:

-   -   The minimum frequency in which the input data is updated. If         each of r=1, . . . , R input variables has an update frequency         of φ_(r), then the frequency at which the residual forecasts can         be updated is φ*=min_(r)(φ_(r)). The value of t_(n)* must be         aligned with this frequency. For example, if the minimum         frequency at which input data is updated on a monthly basis, the         reference value, t_(max) must correspond to month-level temporal         offsets beyond t₀.     -   The expected total lifetime, t_(max), of item i. If item i is         not expected to retain value after period t_(max), then         t_(n)*≦t_(max).     -   The utility of the outputs (residual value forecasts) from the         residual value calculations. For example, if the forecasted         residual values are to be used for annual corporate strategic         planning, t_(n)* should also be based on an annual offset to t₀         (or as close as possible given the two previous, more binding         constraints).

As an example, suppose the initial time point is Jul. 9, 2012 and input data to the model is updated on a monthly basis, the reference period could then be Jul. 9, 2012 to Aug. 9, 2012, Jul. 9, 2012 to Sep. 9, 2012, Jul. 9, 2012 to Oct. 9, 2012, etc. The reference period can be further constrained by the total expected lifetime of the item. For example, if the item is not expected to retain value after five years, then the reference period can be Jul. 9, 2012 to Jul. 9, 2017, or less (in one or more monthly temporal offsets as constrained by the update frequency of the input data to the model).

Once the reference period is determined, a number of forecasts desired between the initial time point and the reference period can be determined (505). The number of forecasts determines how often a forecast of the residual value of the item is to be generated. Starting from the initial time point, the time interval at which a forecast is to be generated can be the same as, or more than, the update frequency of the input data to the model. In some embodiments, the system may determine a constant width of time intervals, Δ_((p.q)), at which forecasts are to be generated for item i. The selection of Δ_((p.q)) can be determined by considering the following constraints:

-   -   It must be chosen such that (t_(n)*−t₀)/Δ_((p.q)) is a positive         integer.     -   It must be greater than or equal to φ*=min_(r)(φ_(r)).

Following the above example in which the expected lifespan of the item is five years, if it is assumed that the reference period is two years, there can be, for example, 23 forecasts, each of which is generated at a fixed time interval of one month. If the time interval is selected to be six months, then four forecasts are generated.

With the time interval determined, the system may determine a locality adjustment (L_(p)) to item i (507). If the value of item i does not vary by geographic region (the value of item i is the same in industry p across all localities at the initial time period), then no locality adjustment needs to be made. If the base value of items in industry p to which item i is assigned varies by geographic region, the baseline value of item i at the initial time point t₀ may be adjusted by computing a ratio between the average cost of items in the industry in a particular locality at a certain time point t_(n) and the local cost of items in the industry across all localities at the same time point t_(n).

In some embodiments, the system may determine locality adjustment, L_(p,n), to item i as follows:

$\begin{matrix} {L_{p,n} = \frac{L_{p,n}^{\prime}(z)}{L_{p,n}^{\prime}(Z)}} & \left\lbrack {{Equation}\mspace{14mu} 4} \right\rbrack \end{matrix}$

where L_(p,n)′(z) represents the average cost of items in industry p in locality z at time t_(n), and L_(p,n)′(Z) represents the local cost of items in industry p across all localities (zεZ) at time t_(n).

As an example, consumer price index can be utilized to determine the cost information on items in various industries relative to localities. As a specific example, this ratio may be determined for items available in the United States by referring to the Consumer Price Index (CPI) provided on a monthly basis by the U.S. Bureau of Labor Statistics. As another example, Statistics Canada produces similar series for that country. Skilled artisans appreciate that many economically-developed countries have consumer price index figures that can be used to generate the locality adjustments. Note that computation of L_(p,n) is normally performed at t₀ and when a value modification is made to account for modifications made to item i with the base configuration.

The system may collect and/or estimate incremental values of modifications, M_(i,n), to the base configuration of item i (509). There can be many types of modifications. One example type can be modifications that are both observable at a particular time t_(n) and are expected to retain some value in future time period(s) after the particular time t_(n). Another example type can be modifications that are not observable and/or not expected to retain value after the particular time t_(n). Equation 5 below illustrates two types of modifications:

-   -   Type A         m_(i,n) ^((α))         : represents modifications made to the base configuration of         item i at time t_(n) that are observable (tangible and         measurable) and are expected to retain some value in future time         periods after time period t_(n),     -   Type B         m_(i,n) ^((b))         : represents modifications made to the base configuration of         item i at time t_(n) that are not observable and/or not expected         to retain value after a modification is made at time t_(n).

$\begin{matrix} {M_{i,n} = \left\{ \begin{matrix} {M_{i,{n - 1}} + {L_{p,{n{(z)}}} \times \left( {m_{i,n}^{(a)} + m_{i,n}^{(b)}} \right)}} & {{{if}\mspace{14mu} t_{n}} \neq {t_{0}\mspace{14mu} {and}\mspace{14mu} {the}\mspace{14mu} {modification}\mspace{14mu} {was}\mspace{14mu} {made}\mspace{14mu} {in}\mspace{14mu} t_{n}} > 0} \\ \left( {m_{i,n}^{(a)} + m_{i,n}^{(b)}} \right) & {{{if}\mspace{14mu} t_{n}} = {t_{0}\mspace{14mu} {and}\mspace{14mu} {the}\mspace{14mu} {modification}\mspace{14mu} {was}\mspace{14mu} {made}\mspace{14mu} {in}\mspace{14mu} t_{0}}} \\ {M_{i,{n - 1}} - \left( {L_{p,{n{(z)}}} + m_{i,n}^{(b)}} \right)} & {{if}\mspace{14mu} t_{n}\mspace{14mu} {period}\mspace{14mu} {following}\mspace{14mu} {the}\mspace{14mu} {last}\mspace{14mu} {Type}\mspace{14mu} B\mspace{14mu} {last}\mspace{14mu} {modification}} \end{matrix} \right.} & \left\lbrack {{Equation}\mspace{14mu} 5} \right\rbrack \end{matrix}$

By adjusting the base configuration's value to account for modifications, M_(i,n), and locality adjustments, L_(p,n), the system may determine BV_(i,n) (see Equation 6)—a locally adjusted value of item i as modified (“modified item i”) at the forecast time t_(n) (511). An example of this process is illustrated in FIG. 10, which shows the effects of modification adjustments and locality adjustments over time.

BV_(i,n)=(V _(i,h) +M _(i,n))×(τ_(i,n) ×L _(p,n))  [Equation 6]

In some embodiments, the system may construct competitive sets of similar and/or substitute items in the same industry to which item i belongs (513). This construction may involve partitioning all items in the industry into k distinct clusters based on a measure of similarity between all pairs of items in the industry. A full explanation of an example competitive set approach is provided in U.S. Pat. No. 8,661,403, issued Feb. 25, 2014, entitled “SYSTEM, METHOD AND COMPUTER PROGRAM PRODUCT FOR PREDICTING ITEM PREFERENCE USING REVENUE-WEIGHTED COLLABORATIVE FILTER,” which is fully incorporated herein by reference. Other competitive set approaches may also be possible.

As an example, a durable good, x_(i), can be described by its features (1, . . . , m) (also known as characteristics or variables) as follows:

-   -   x_(i)={x_(i,1), x_(i,2), . . . , x_(i,m)}         and all N distinct goods may be represented in matrix form as

$X = {\begin{bmatrix} x_{1,1} & x_{1,2} & \ldots & x_{1,{m - 1}} & x_{1,m} \\ x_{2,1} & x_{2,2} & \ldots & x_{2,{m - 1}} & x_{2,m} \\ \vdots & \vdots & \ddots & \vdots & \vdots \\ x_{{N - 1},1} & x_{{N - 1},2} & \ldots & x_{{N - 1},{m - 1}} & x_{{N - 1},m} \\ x_{N,1} & x_{N,2} & \ldots & x_{N,{m - 1}} & x_{N,m} \end{bmatrix}.}$

The similarity, s_(ij), between item i and item j based on a comparison of Q observable features, can be computed using the Minkowski metric:

$\begin{matrix} {s_{ij} = {1 - \left\lbrack {\sum\limits_{q = 1}^{Q}{w_{q}{{x_{i,q} - x_{j,q}}}^{\lambda}}} \right\rbrack^{1/\lambda}}} & \left\lbrack {{Equation}\mspace{14mu} 7} \right\rbrack \end{matrix}$

where λ≧0, 0≧s_(ij)≦1, and Σ_(q=1) ^(Q)w_(q)=1. Note that although the format of the data for some options (e.g., original equipment manufacturer options, dealer-installed vehicle options, etc.) may not be numeric, similarity can still be established across features by first transforming the data to a numeric scale. Programming techniques necessary to perform such a data transformation (e.g., text mining to transform text strings to numerical fields) are known to those skilled in the art and thus are not further described herein.

At time period t_(n), the system may compute the similarity for every pair of the N observations in the data set X, X_(i)≠X_(j), and then a N×N matrix of similarities, S_(n). There isn't a need to compute the values of s since the similarity between an observation and itself is, by definition, 1. With a subtraction from an N×N identity matrix, the dissimilarities can be computed (S _(n)=1−S_(n)) and used to build clusters, at time t_(n), comprising K distinct competitive sets, U_(k,n) (k=1, . . . , K). Using S _(n), the system can employ any one of a variety of hierarchical clustering methods to partition the observations into distinct competitive sets. Examples of hierarchical clustering methods can be found in A. D. Gordon, CLASSIFICATION, 1999. When the number of observations, N, is large, the system may employ the K-means clustering method after reprojecting S _(n) into an Q-dimensional set of points on a scale that preserves the dissimilarities that are invariant to translation and rotation. The mechanics of the K-means clustering method below can be found in Hartigan, J. A. and Wong, M. A., “A K-means Clustering Algorithm,” Applied Statistics 28, 1979, pp. 100-108.

1) Decide on a value for K.

2) Define K cluster centers (randomly, if necessary).

3) Decide the class memberships of the N objects by assigning them to the nearest cluster center.

4) Re-estimate the K cluster centers, by assuming the memberships found above are correct.

5) If none of the N objects changed membership in the last iteration, exit. Otherwise, go to 3).

As a specific example, if the K-means clustering method is employed, the system may partition the data into K clusters by maximizing the within-cluster variation. If a cluster is indexed by k containing n_(k) observations, the overall within cluster variance based on a clustering outcomes is:

$\begin{matrix} {\sigma_{w}^{2} = {\sum\limits_{k = 1}^{K}{\sum\limits_{q = 1}^{Q}{\sum\limits_{i = 1}^{n_{q}}\left( {x_{{(k)}_{i,q}} - {\overset{\_}{x}}_{{(k)}_{\bullet,q}}} \right)^{2}}}}} & \left\lbrack {{Equation}\mspace{14mu} 8} \right\rbrack \end{matrix}$

And the overall variance of the clustering outcome is the sum of the within-cluster and between-cluster variances:

σ² = σ_(w)² + σ_(b)².

Skilled artisans appreciate that a number of statistics may be utilized to decide how many clusters are to use. As a specific example, the Calinski-Harabasz index may be used:

$\begin{matrix} \frac{\sigma_{b}^{2}/\left( {K - 1} \right)}{\sigma_{w}^{2}/\left( {N - K} \right)} & \left\lbrack {{Equation}\mspace{14mu} 9} \right\rbrack \end{matrix}$

At every time period, t_(n), since the variables used to compute similarity may be time-dependent, the competitive set can be recomputed. At the end of this process, every item i, . . . , I will belong to one-and-only-one of the K competitive sets, U_(k,n).

To account for macroeconomic factor(s), the system may collect non-industry-specific macroeconomic data, F_(−,n|n-h), and either forecast future levels or incorporate existing forecasts from other sources to determine a macroeconomic factor {circumflex over (F)}_(.,n|n-h) (515). Here, “F.” implies that the macroeconomic factors are taken over all industries and not specific to any particular industry p. “{circumflex over (F)}.” indicates that it is an estimated value.

The single-dimensional macroeconomic factor, {circumflex over (F)}_(.,n|n-h) can be represented by a linear combination of Q variables, f_(.,n|n-h),q)(q=1, . . . , Q), where Q represents the number of macroeconomic features under consideration, for example, housing prices, gas prices, unemployment, the Dow Jones Industrial Average, etc., and q represents one single macroeconomic feature. If the current time period is t_(m), the information regarding future periods t_(n)>t_(m), will need to be forecasted.

Additionally, the data source may be internally-derived by the organization generating the residual value forecasts (and the value of the q^(th) variable at time t_(m) is denoted {circumflex over (f)}_(.,(m|m-h),q)). An example of an internally-derived data source is the ALG economic index shown in FIGS. 11 and 12. In this disclosure, the “ALG economic index” refers to a proprietary statistical measure of changes in a representative group of individual data points derived by ALG, Inc. of Santa Monica, Calif. The ALG economic index tracks current economic health and can be driven, for example, by three components—overall retail spending in the economy, employment ratio (e.g., how many people out of a working population are employed), and per capita gross domestic product (GDP). Alternatively, it may be from an external source such as an organization that provides economic analysis/forecasting (and the value of the q^(th) variable at time t_(m) is denoted by {circumflex over (f)}′_(.,(m|m-h),q)).

When gathering the data from multiple sources, it becomes necessary to combine them into a single value, f_(j,m,q). One method for combining these values is to create a single value which gives more weight to the data source or data type in which higher confidence is held. For example, a competitive set is a collection of vehicle data (e.g., Model Year, Make, Model, Trim, etc.) which are in a particular segment (e.g., midsize sedans such as Honda Accord and Toyota Camry). A more complete data set with, for instance, good pricing information will provide higher confidence as the competitive set can more easily be determined based on, for example, reliable pricing data. For the purpose of explanation, the system may use item j and time t_(m), where item j is used to represent a suitable substitute for item i in the same competitive set. Or, i and j may be the same item if a sufficient amount of historical data is available. The subscript m is for the time period as it may be possible that historical information required to estimate model parameters is only available from period t_(m-h) to t_(m) where t_(n)≧t_(m).

Accordingly, the combining equation at time t_(m) for item j can be expressed as follows:

$\begin{matrix} {f_{.{,m,q}} = {{{\phi_{m}{\hat{f}}_{.{,{({m|{m - h}})},q}}} + {\left( {1 + \phi_{n}} \right){\hat{f}}_{.{,{({m|{m - h}})},q}}^{\prime}\mspace{14mu} 0}} \leq \phi_{m} \leq 1}} & \left\lbrack {{Equation}\mspace{14mu} 10} \right\rbrack \\ {\mspace{79mu} {where}} & \; \\ {\mspace{79mu} {\phi_{m} = {\frac{\tau_{m}^{2}}{\tau_{m}^{2} + \sigma_{m}^{2}}.}}} & \left\lbrack {{Equation}\mspace{14mu} 11} \right\rbrack \end{matrix}$

Here, τ_(m) ² represents the squared estimation error for the externally-derived variable, {circumflex over (f)}′_(.,(m|m-h),q)), and σ_(m) ² represents the squared estimation error for the externally-derived variable, {circumflex over (f)}′_(.,(m|m-h),q)). After all of the variables have been collected reflecting their values from time period t_(m-h) to t_(n), the relationship between these variables and the modification/locality-adjusted base value, BV_(j,m), can be expressed as a linear combination of input variables:

BV_(j,m)(f)=a _(o)+Σ_(q=1) ^(Q)α_(q) f _(.,m,q)+ε_(j,m).  [Equation 12]

To determine the values of the Q+1 coefficients, α₀, . . . , α_(Q), the system may use the statistical method of Ordinary Least Square (OLS) regression as shown in Equation 13 below:

_(j,m)(f)={circumflex over (α)}_(o)+Σ_(q=1) ^(Q){circumflex over (α)}_(q) f _(.,m,q)  [Equation 13]

where the estimated values, {circumflex over (α)}_(o), . . . {circumflex over (α)}_(Q), are chosen such that the sum of squared errors for good j (SSEj) as shown in Equation 14 below is minimized.

SSE_(j)(f)=Σ_(m=0) ^(h≧n)(BV_(j,m)(f)−

_(j,m)(f))².  [Equation 14]

The estimation of the linear coefficients, {circumflex over (α)}_(o), . . . , {circumflex over (α)}_(Q), need not be computed at every period, rather the coefficients can be updated periodically, say at time t_(p), and then used to forecast the value of

_(j,m)(f)={circumflex over (α)}, . . . {circumflex over (α)}_(Q),for any time period t_(m)≧t_(p).

As the final step, once the observed or forecasted values of f_(i,n,q) are determined, the macroeconomic factor for item i, {circumflex over (F)}_(i,n|n-h), can be estimated as shown in Equation 15 below.

$\begin{matrix} {{\hat{F}}_{i,{n|{n - h}}} = {\left( \frac{{\hat{\alpha}}_{0} + {\sum\limits_{q = 1}^{Q}{{\hat{\alpha}}_{q}f_{.{,n,q}}}}}{{BV}_{i,n}} \right).}} & \left\lbrack {{Equation}\mspace{14mu} 15} \right\rbrack \end{matrix}$

In some embodiments, the system may also collect industry-specific microeconomic data, G_(p,n|n-h), for industry p in which item i being evaluated is classified and determine a microeconomic factor, g_(p,n|n-h) (517). The single-dimensional microeconomic factor for item i, Ĝ_(i,n|n-h), can be represent by a linear combination of Q variables, g_(p,(n|n-h),q)(q=1, . . . , Q) for industry trends, industry-specific inventories/supply, industry-specific demand, etc. If the current time period is t_(m), the information regarding future periods t_(n)>t_(m), will need to be forecasted.

Additionally, the data source may be internally-derived by the organization generating the residual value forecasts (and the value of the q^(th) variable at time t_(m) is denoted ĝ_(p,(m|m-h),q)) Alternatively, it may be from an external source such as an organization that provides economic analysis/forecasting (and the value of the q^(th) variable at time t_(m) is denoted by ĝ′_(p,(m|m-h),q)). For microeconomic features and macroeconomic features, q=1, . . . , Q variables have been denoted where F is the combination of variables (q=1, . . . Q) for macroeconomic features, f is a single dimensional macroeconomic variable, G is the combination of all microeconomic variables (q=1, . . . Q), and g is a final single dimensional microeconomic variable.

When gathering the data from multiple sources, it becomes necessary to combine them into a single value, f_(p,m,q). As explained above, q=1, . . . Q is being used for both microeconomic and macroeconomic variables. That is, q refers to the amount of variables, and Q and R the amount of betas. r is used here to describe the various external forecasting sources of one factor q (e.g., segment supply) in g (i.e., r refers to weighting one factor within g (factor g=1, . . . Q) by various sources r1, r2, etc.) that can be combined to come up with q such that BV_(j,m)(g)=β_(o)+Σ_(r=q) ^(Q)β_(q) g_(p,m,q)+ε_(j,m) (see Equation 18). In this way, one variable can reflect the data from multiple sources. As described above, one method for combining these values is to create a single value which gives more weight to the data source in which higher confidence is held. Following the above example notation (Equation 15), the combining equation at time t_(m) for item j can be:

$\begin{matrix} {{\hat{g}}_{p,m,q} = {{{\gamma_{m}{\hat{g}}_{p,{({m|{m - h}})},q}} + {\left( {1 + \gamma_{m}} \right){\hat{g}}_{p,{({m|{m - h}})},q}^{\prime}\mspace{14mu} 0}} \leq \gamma_{m} \leq 1}} & \left\lbrack {{Equation}\mspace{14mu} 16} \right\rbrack \\ {\mspace{79mu} {where}} & \; \\ {\mspace{79mu} {\gamma_{m} = \frac{\tau_{m}^{2}}{\tau_{m}^{2} + \sigma_{m}^{2}}}} & \left\lbrack {{Equation}\mspace{14mu} 17} \right\rbrack \end{matrix}$

Here, τ_(m) ² represents the squared estimation error for the externally-derived variable, ĝ′_(p,(m|m-h),q)), and σ_(m) ² represents the squared estimation error for the externally-derived variable, ĝ_(p,(m|m-h),q)). “Externally” in this case means that they (e.g., external sources/variables) are outside of the equation and are not determined by the equation. After all of the variables have been collected reflecting their values from time period t_(m-h) to t_(n), the relationship between these variables and the modification/locality-adjusted base value, BV_(j,m), can be expressed as a linear combination of input variables as follows:

BV_(j,m)(g)=β_(o)+Σ_(r=q) ^(Q)β_(q) g _(p,m,q)+ε_(j,m)  [Equation 18]

To determine the values of the Q+1 coefficients, β₀, . . . , β_(Q), the system may use the statistical method of OLS regression as shown in Equation 19:

_(j,m)(g)={circumflex over (α)}_(o)+Σ_(q=1) ^(RQ){circumflex over (β)}_(q) g _(p,m,q)  [Equation 19]

where the estimated values, {circumflex over (β)}_(o), . . . , {circumflex over (β)}_(Q) are chosen such that the sum of squared errors for item j (SSEj) as shown in Equation 20 is minimized.

SSE_(j)(f)=Σ_(m=0) ^(h≦n)(BV_(j,m)(g)−

_(j,m)(g))²  [Equation 20]

The estimation of the linear coefficients {circumflex over (β)}_(o), . . . , {circumflex over (β)}_(Q), need not be computed at every period. Rather, the coefficients can be updated periodically, for instance, at time t_(p), and then used to forecast the value of

_(j,m)(f)={circumflex over (β)}_(o), . . . , {circumflex over (β)}_(Q), . . . , for any time period t_(m)≧t_(p).

As the final step, once the observed or forecasted values of g_(p,n,q) are determined, the microeconomic factor can be estimated as shown in Equation 21 below.

$\begin{matrix} {{\hat{G}}_{i,{n|{n - h}}} = {{\left( \frac{{\hat{\beta}}_{o} + {\sum\limits_{r = 1}^{R}{{\hat{\beta}}_{r}g_{p,n,q}}}}{{BV}_{i,n}} \right)\mspace{11mu} {where}\mspace{14mu} i} \in {p.}}} & \left\lbrack {{Equation}\mspace{14mu} 21} \right\rbrack \end{matrix}$

With all the pieces assembled, the system can generate a residual value for time t_(n) for items i (519). As an example, this can be accomplished by substituting the values constructed in earlier steps into Equation 22 below.

{circumflex over (V)} _(i,h)=BV_(i,0)+(Σ_(q=1) ^(Q){circumflex over (α)}_(q) f _(.,n,q))({circumflex over (F)} _(n|n-h) −F _(t0))+(Σ_(r=1) ^(QR){circumflex over (β)}_(r) g _(p,n,q))(Ĝ _(p,n|n-h) −G _(t0)))h=1, . . . ,H  [Equation 22]

Equation 22 and its computational components (with their corresponding driving factors) are illustrated in FIG. 11 which depicts a plot diagram illustrating percentage points adjustments by factor, according to some embodiments disclosed herein. In the example of FIG. 11, the first computational component ({circumflex over (V)}_(i,h)) is driven by the baseline value of the item of interest (over a historical average), the second computational component ((Σ_(q=1) ^(Q){circumflex over (α)}_(g)f_(.,n,q))({circumflex over (F)}_(n|n-h)−F_(t0))) is driven by a number of macroeconomic factors, such as gas, ALG economic index, and industry supply), and the third computational component ((Σ_(r=1) ^(QR){circumflex over (β)}_(r)g_(p,n,q))(Ĝ_(p,n|n-h)−G_(t0))) is driven by a number of microeconomic factors such as segment supply (e.g., the supply level of a used vehicle market segment of interest), model supply (e.g., the supply level of a used vehicle model of interest), incentive spending (e.g., incentives offered by the vehicle manufacturer of the vehicle model of interest), rental fleet penetration, redesign, and ALG brand outlook or value. Rental fleet penetration reflects a percentage of new cars entering the rental fleet. For instance, 2,000 of 20,000 new cars sold in a month to a rental company means a 10% rental fleet penetration. Redesign refers to vehicle updates such as a complete new generation (e.g., a complete new model), minor updates (e.g., frontend design changes), or major updates (e.g., interior changes, new powertrain, etc.). Brand outlook or value refers to a measure used by a brand to determine a level of brand trending. Brand outlook can be measured statistically in the used transaction data where the brand rank order in the data can be identified. Consumer surveys can also be used to rank brands.

Optionally, residual value forecasting method 500 may further comprise performing one or more quality assurance (QA) operations on the generated output (521). In some embodiments, the system may compare the forecasted values with a set of reference values. The time point at which the forecasted residual curve is aligned occurs at t*_(n) selected previously (see 505). The approach for adjusting {circumflex over (V)}_(i,n) for QA purposes may include the following steps:

a. Gather residual values from other vehicles in the competitive set (see FIG. 9, which shows a final adjustment based on competitive set comparison). These may include:

-   -   The average residual value at t*_(n) for the entire competitive         set U_(k);     -   The baseline value BV_(j,0) at t₀ for item j in the same         competitive set that is most similar (best match) to item i; and     -   The residual value of item k that is a previous version of item         i (not a modification of item i, but the one that was replaced         in production by item i), if it exists.

b. Compute the adjustment value, C_(iU,n|n*), that will minimizes the weighted average error relative to the position implied by the reference points as shown in Equation 23 below:

C _(iU,n|n*)=Δ[(αV _(U,n)+βBV_(j,0) +rV _(k,n))−V _(i,n)]  [Equation 23]

where α, β, r are assigned weights, α+β+r=1 and α,β,r>0, Δ is a weight depending on whether item i at t_(n) is completely new in the market (Δ=1) or established (Δ<1).

If necessary, the system may adjust the output from Equation 22 with the output from Equation 23 to generate a final forecast of residual value of the item (523). As an example, the system may adjust {circumflex over (V)}_(i,n) by C_(i,U,n|n*) to get the final forecasted value:

{circumflex over (V)} _(i,h)=BV_(i,0)+((Σ_(q=1) ^(Q){circumflex over (α)}_(q) f _(.,n,q))({circumflex over (F)} _(n|n-h) −F _(t0))+(Σ_(r=1) ^(R){circumflex over (β)}_(r) g _(p,n,q))(Ĝ _(p,n|n-h) −G _(t0)))+C _(iU,n|n*) h=1, . . . ,H  [Equation 24]

Referring to FIG. 9, an exemplary residual curve adjustment is shown. In FIG. 9, the dotted line (900) represents the initial computation of the curve. Points A, B, and C represent the average residual values of the competitive set, the current market value of the best matching item in the competitive set, and the previous value of the item of interest, respectively. Taking these data points into account, the final revised residual curve is shown as line 910.

FIG. 6 is a flow diagram illustrating an example of a method for optionally revising a generated residual value curve based on qualitative input via a feedback cycle, according to some embodiments disclosed herein.

In some embodiments, after a baseline residual curve has been generated and stored in a local data storage device, a user of the enterprise computing environment can provide editorial input (605) that is used to revise the residual curve (610). The editorial input may be provided to account for any factors that were not accounted for in the generation of the baseline curve, or that have changed since the baseline curve was generated. The editorial input may also be provided to determine the potential effect of various factors on the residual curve. The editorial input may be provided through a workbench application (see, e.g., FIG. 12) that allows the user to see the results of the input. The residual curve that is revised according to the editorial input can then be made “live” (615). In other words, the revised residual curve can be stored or published to a location to which client access can be enabled. The system allows for periodic revision of the residual curve. If it is time to do so (620) (e.g., if a predetermined interval has been reached), the user can provide additional editorial input (605) for generation of a newly revised residual curve (610), which can then be published for access by the client (615).

In one embodiment, the residual curve is updated at regular intervals. The updated residual curve can be stored in place of the previous baseline curve and used as the baseline for future use. When the residual curve is updated, several comparisons are made to ensure that the newly revised curve is reasonable. For example, the revised curve is compared to the previous curve to determine whether the values of the new curve differ from the previous curve by a substantial amount. If the difference is too great, this may indicate that the inputs to the revised curve are not realistic. The inputs may therefore be adjusted to bring the revised residual curve closer to the previous curve. In one embodiment, the residual curve is also adjusted based on the current values of items in a competitive set. For instance, the curve may be adjusted to bring the curve closer to the value of a closest competitive item, or to the average value of the set of competitive items.

FIG. 7 is a flow diagram illustrating an example of a method for optionally allowing a client to provide qualitative input on a generated residual value curve, according to some embodiments disclosed herein.

In some embodiments, after a baseline residual curve is revised, the server may enable access by a client to the revised curve (705). Customers can access the residual curve through the client to determine the value of the item at some point in the future. The client in this embodiment includes a workbench application that allows the customer to vary some of the factors that affect the residual curve and to view the resulting changes to the residual curve. The server receives input from the client's workbench application (710) and revises the residual curve according to the received input (715). The newly revised residual curve is then provided to the client (720) so that it can be viewed by the customer.

Example Implementation in Automotive Industry

The following describes an exemplary implementation in which the approach described above is adapted to be used to estimate the value of automobiles over time and thereby allows the resale values that could be expected at future time points to be determined. Residual values thus estimated can provide guidelines for pricing fixed-term vehicle leases which captures the expected change in value that will result in the time interval between the leased vehicle's acquisition at time t₀ and its disposition at time t_(d). This example implementation not only can provide the estimated residual value at disposition, V_(i,d), but can also forecast values at equally-spaced fixed points between t₀ and t_(d), thereby allowing construction of a residual curve that captures the relationship between vehicle value and time. Over time and as new information becomes available, this example implementation continues to timely reflect changing values of exogenous macroeconomic and industry-specific microeconomic variables and vehicle-specific, endogenous variables (depreciation, competitive sets, and modifications).

In this example implementation, the guidelines for production of the residual values include:

-   -   The residual values must be realistic and adjusted over the         vehicle's lifecycle to reflect the market, incentives and fleet         purchases.     -   To enable vehicle manufacturers to market their vehicles,         residual values must create clear, consistent messages across         all vehicles being valued.     -   To remain relevant and timely, the residual values must reflect         product enhancements, packaging/content adjustments, etc.     -   To provide utility to each manufacturer's ecosystem, the         residual values must encompass all phases of the automotive         sales cycle, including dealer engagement, manufacturer support,         cooperation on pricing, and off lease supply management.

In this example, the methodology described above is adapted to estimate residual values of cars and light trucks in the United States and Canada. Estimates are updated every two months to reflect new observed data, market conditions, and macroeconomic estimates. As an example of this embodiment, the latest 2017 Model Year (MY) Hyundai Elantra SE with automatic transmission (AT)—which sells at popular equipped MSRP of $19,785 in California—will be used (see FIG. 4 for an image). This particular model has some historical used market value and residual value data available to estimate the future value of the vehicle. Furthermore, exogenous macroeconomic data and microeconomic data as well as endogenous factors (e.g., depreciation rate and competitive knowledge) are readily available to construct the current residual value curve for this vehicle at any term (e.g., 12-month, 24-month, 36-month, . . . , 60-month or any term in between).

Step 1. Determine a baseline, unmodified value for the item i, V_(i,h). The 2014 MY Hyundai Elantra SE AT (item i) baseline value for t₀ is V_(i,h)=$11,075 and is based on an observed current market value (CMV) derived from auction data. Roughly 990 auction records were available in an h estimation period for item i to create the CMV of $11,078 by applying statistical filters and other measures to cleanse the data. For the purpose of illustration and not of limitation, auction records may include such information as: Sale Date; National Automobile Dealers Association (NADA) Vehicle Identification Code; Make; Sub-make; Model Year; Series; Body Style; Diesel 4WD Identifier; NADA Region code; Sale Price; Mileage; Sale Type; Vehicle Identification Number (VIN); Vehicle Identifier (VID); etc.

Step 2. At time t_(n)=0, determine a reference period, t_(n*), at which adjustments will be made to align values with other items in the competitive set based on the following industry-level frequencies that constrain the choice of t_(n*):

-   -   auction data is updated weekly yet also aggregated to monthly         numbers, while microeconomic factors and macroeconomic factors         are updated monthly;     -   forecasted terms go up to 72-month, t_(max) is greater than         72-month;     -   most common terms are 12, 24, 36, 48, and 60-month terms and,         mostly, 36-month is used.

Because a 36-month alignment is commonly used in the automotive industry, a value of t_(n*)=36 months is used in this example for the reference period relative to the baseline.

Step 3. At time t_(n)=0, determine the constant width of time intervals Δ_((p.q)) at which forecasts will be generated. In this case, the selection of Δ_((p.q)) is determined by considering the following constraints:

-   -   It must be chosen such that (t_(n*)−t₀)Δ_((p.q)) is a positive         integer where t_(n*)−t₀=36 months.     -   It must be greater than or equal to φ*=min_(r)(φ_(r))=weekly         since that is the frequency at which the macroeconomic data is         updated.

Given those constraints and a choice of t_(n*)=36, the Δ_((p.q))=2 months is used.

-   -   36-month term/2 month=18>0.     -   Interval is greater than φ*(weekly data).

Step 4. Determine a locality adjustment, L_(p). If the base value of the items in industry p to which item i is assigned varies by geographic region, then compute

$L_{p,n} = \frac{L_{p,n}^{\prime}(z)}{L_{p,n}^{\prime}(Z)}$

where L′_(p,n)(z) is the average cost of items in industry p in locality z at time t_(n), and L′_(p,n)(Z) is the local cost of items in industry p across all localities (zεZ) at time t_(n). In this example, the residual value of the 2017MY Hyundai Elantra SE AT is being established for California, located in the z=“US West” region of the U.S. and where L_(west)=1/1′, local adjustment for U.S. Western region is 100% of the average for all regions in the U.S.

Step 5. Collect or estimate incremental values of modifications, M_(i,n), to the base configuration of the item. In this example, the vehicle has cruise control added as popularly equipped which retains a measurable and tangible value of $375 at 36-month. Thus, M_(Elantra,36-month)=$375 for all regions, M_(Elantra,36-month)+($375*1.0). for U.S. Western region.

Step 6. Determine the locally-adjusted value of the modified item i at time t_(n) by adjusting the base configuration's value to account for modifications and locality adjustments.

BV_(i,n)=(V′ _(i,0) +M _(i,n))×(τ_(i,n) ×L _(p,n))=($11,075+$375)*1.0=$11,450

Step 7. Construct competitive sets, C_(iW,n), of similar and substitute items in the same industry, p. This involves determining what factors to compare to for each competitor and establishing a matrix such as pricing (e.g., MSRP), engine and performance (e.g., horse power, mile per gallon, torque, displacement, etc.), exterior (e.g., curb weight, wheelbase, length, width, height, wheels size, etc.), interior (e.g., dimensions, features, air conditioning, entertainment system, seats, etc.) and safety.

Based on the factors above and the matrix analysis, for example, the 2014 Honda Civic LX AT has the most similarities to the 2014 Hyundai SE AT, followed by 2014 Toyota Corolla L AT.

Step 8. Collect macroeconomic data, F_(-,n|n-h,) and either forecast future levels or incorporate existing forecasts from other sources to determine {circumflex over (F)}_(.,n|n-h).

As an example, suppose the ALG economic index, industry-wide used market supply index, and gas prices are collected at t₀ and forecasted for t_(n) (see, e.g., FIG. 11). Further suppose the ALG economic index is equal to 100 index points, industry-wide used market supply index is equal to 100 index points, and average gas prices are $2.09 per gallon in t₀, whereas the forecasts are 111 points for the ALG economic index, 123 points for industry wide used market supply, and $2.67 for gas price in t_(36-month). The various factors have coefficients which determined based on correlation to auction data and thus the impact on the forecasted values can be applied by using the coefficients. Hence, for example, based on the change in the ALG economic index from currently 100 to 111 in 36-month, the impact on 36-month residual values is an incremental $60, from industry wide used market supply −$450, and from gas prices $165. The total adjustment for macroeconomic variables is −$225 or, mathematically, +a1*(111−100)−a2*(11.5−11.0 million)+a3*($3.00−$3.50)=$60−$450+$165=−$225.

Step 9. Collect microeconomic data, G_(p,n|n-h,) for the industry in which the item being evaluated is classified and forecast future levels or incorporate existing forecasts from other sources to determine Ĝ_(p,n|n-h).

Microeconomic data such as segment-level and model-level used vehicle market supply, brand value, incentive spending, and rental fleet penetration is generated for t₀ and forecasted for t_(36-month). For example, current incentive spending for the Elantra is $2,600, yet the forecast is expected to be at $2,550. Based on the change in incentive spending from today to 36-month, the impact is $15, brand value $100, and used vehicle market supply (at the segment level and the model level)−$160 (see Equations 16-21). The total adjustment for microeconomic variables is −$45, or mathematically, −b1*($1,900 $2,000)−b2*(20−18 index points)−b3*(100−120)=$15+$100−$160=−$45.

Step 10. With all the pieces assembled, forecasting the residual value for time t_(n*)=36 month for the 2017 Hyundai Elantra SE AT (cruise control, in California) can be done by substituting the values constructed in earlier steps into Equation 22:

$\begin{matrix} {{\hat{V}}_{i,h} = {{BV}_{i,0} + \left( {{\left( {\sum\limits_{q = 1}^{Q}{{\hat{\alpha}}_{q}f_{.{,n,q}}}} \right)\left( {{\hat{F}}_{n|{n - h}} - F_{t\; 0}} \right)} + {\left( {\sum\limits_{r = 1}^{R}{{\hat{\beta}}_{r}g_{p,n,q}}} \right)\left( {{\hat{G}}_{p,{n|{n - h}}} - G_{t\; 0}} \right)}} \right)}} \\ {= {{\left( {{{\$ 11},075} + {\$ 375}} \right)*1.0} + \left( {\left( {- {\$ 225}} \right) + \left( {- {\$ 45}} \right)} \right)}} \\ {= {{{\$ 11},450} - {\$ 270}}} \\ {= {{\$ 11},180}} \end{matrix}$

Step 11. Perform quality assurance (QA). In this example, this involves computing the adjustment value, C_(iU,n|n)*(see Equation 23) that will minimizes the weighted average error relative to the position implied by the reference points.

In this example, adjustment value in the case of the Camry LE AT is small since Δ<1, plenty of history is available. The average residual value of entire competitive set is $10,840 and the following factors are taken into account:

-   -   i. Baseline of the closest competitor(s) is $11,465     -   ii. 2014 M Y Hyundai Elantra SE AT is $11,120

Applying Equation 23 described above, the adjustment value C_(iU,n|n*) is then equal to: 0.25×((0.33×$10,840+0.33×$11,120+0.33×$11,465)−$11,180)=−$38

Step 12. Adjust {circumflex over (V)}_(i,h) by C_(iU,n|n*) to determine the final forecasted value:

In this example, applying Equation 24 described above, the final forecast for the 2017 MY Hyundai Elantra SE AT for the Western region for time t_(n*)=36 month is $11,180−$38=$11,142.

Embodiments disclosed herein can provide many advantages. For example, knowledge of the future residual values can be used to:

-   -   a) Set leasing rates of an item which reflect the expected         change in valuation of the item between the beginning and ends         of a fixed lease period—a useful metric that can be used in the         rental industry.     -   b) Determine the amount at which an item can be resold at any         time period—a useful metric that can be used in investment         decisions such as real estate.     -   c) Provide information supporting the strategic planning         decisions made of the manufacturer of item i.     -   d) Determine if the change in value will be constant over time         intervals of the same length.

These, and other, aspects of the disclosure and various features and advantageous details thereof are explained more fully with reference to the exemplary, and therefore non-limiting, embodiments illustrated and detailed in this disclosure. It should be understood, however, that the detailed description and the specific examples, while indicating the preferred embodiments, are given by way of illustration only and not by way of limitation. Descriptions of known programming techniques, computer software, hardware, operating platforms and protocols may be omitted so as not to unnecessarily obscure the disclosure in detail. Various substitutions, modifications, additions and/or rearrangements within the spirit and/or scope of the underlying inventive concept will become apparent to those skilled in the art from this disclosure.

Embodiments discussed herein can be implemented in a computer communicatively connected to a network (for example, the Internet), another computer, or in a standalone computer. As is known to those skilled in the art, a suitable computer can include a central processing unit (“CPU”), at least one read-only memory (“ROM”), at least one random access memory (“RAM”), at least one hard drive (“HD”), and one or more input/output (“I/O”) device(s). The I/O devices can include a keyboard, monitor, printer, electronic pointing device (for example, mouse, trackball, stylist, touch pad, etc.), or the like. In embodiments of the invention, the computer has access to at least one database over a network connection.

ROM, RAM, and HD are computer memories for storing computer-executable instructions executable by the CPU or capable of being complied or interpreted to be executable by the CPU. Suitable computer-executable instructions may reside on a computer readable medium (e.g., ROM, RAM, and/or HD), hardware circuitry or the like, or any combination thereof. Within this disclosure, the term “computer readable medium” or is not limited to ROM, RAM, and HD and can include any type of data storage medium that can be read by a processor. Examples of computer-readable storage media can include, but are not limited to, volatile and non-volatile computer memories and storage devices such as random access memories, read-only memories, hard drives, data cartridges, direct access storage device arrays, magnetic tapes, floppy diskettes, flash memory drives, optical data storage devices, compact-disc read-only memories, and other appropriate computer memories and data storage devices. Thus, a computer-readable medium may refer to a data cartridge, a data backup magnetic tape, a floppy diskette, a flash memory drive, an optical data storage drive, a CD-ROM, ROM, RAM, HD, or the like.

The processes described herein may be implemented in suitable computer-executable instructions that may reside on a computer readable medium (for example, a disk, CD-ROM, a memory, etc.). Alternatively, the computer-executable instructions may be stored as software code components on a direct access storage device array, magnetic tape, floppy diskette, optical storage device, or other appropriate computer-readable medium or storage device.

Any suitable programming language can be used to implement the routines, methods or programs of embodiments of the invention described herein, including C, C++, Java, JavaScript, HTML, or any other programming or scripting code, etc. Other software/hardware/network architectures may be used. For example, the functions of the disclosed embodiments may be implemented on one computer or shared/distributed among two or more computers in or across a network. Communications between computers implementing embodiments can be accomplished using any electronic, optical, radio frequency signals, or other suitable methods and tools of communication in compliance with known network protocols.

Different programming techniques can be employed such as procedural or object oriented. Any particular routine can execute on a single computer processing device or multiple computer processing devices, a single computer processor or multiple computer processors. Data may be stored in a single storage medium or distributed through multiple storage mediums, and may reside in a single database or multiple databases (or other data storage techniques). Although the steps, operations, or computations may be presented in a specific order, this order may be changed in different embodiments. In some embodiments, to the extent multiple steps are shown as sequential in this specification, some combination of such steps in alternative embodiments may be performed at the same time. The sequence of operations described herein can be interrupted, suspended, or otherwise controlled by another process, such as an operating system, kernel, etc. The routines can operate in an operating system environment or as stand-alone routines. Functions, routines, methods, steps and operations described herein can be performed in hardware, software embodied on hardware, firmware or any combination thereof.

Embodiments described herein can be implemented in the form of control logic in hardware or a combination of software and hardware. The control logic may be stored in an information storage medium, such as a computer-readable medium, as a plurality of instructions adapted to direct an information processing device to perform a set of steps disclosed in the various embodiments. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will appreciate other ways and/or methods to implement the invention.

It is also within the spirit and scope of the invention to implement in software programming or code an of the steps, operations, methods, routines or portions thereof described herein, where such software programming or code can be stored in a computer-readable medium and can be operated on by a processor to permit a computer to perform any of the steps, operations, methods, routines or portions thereof described herein. The invention may be implemented by using software programming or code in one or more general purpose digital computers, by using application specific integrated circuits, programmable logic devices, field programmable gate arrays, optical, chemical, biological, quantum or nanoengineered systems, components and mechanisms may be used. In general, the functions of the invention can be achieved by any means as is known in the art. For example, distributed, or networked systems, components and circuits can be used. In another example, communication or transfer (or otherwise moving from one place to another) of data may be wired, wireless, or by any other means.

A “computer-readable medium” may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, system or device. The computer readable medium can be, by way of example only but not by limitation, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, system, device, propagation medium, or computer memory. Such computer-readable medium shall generally be machine readable and include software programming or code that can be human readable (e.g., source code) or machine readable (e.g., object code). Examples of non-transitory computer-readable media can include random access memories, read-only memories, hard drives, data cartridges, magnetic tapes, floppy diskettes, flash memory drives, optical data storage devices, compact-disc read-only memories, and other appropriate computer memories and data storage devices. In an illustrative embodiment, some or all of the software components may reside on a single server computer or on any combination of separate server computers. As one skilled in the art can appreciate, a computer program product implementing an embodiment disclosed herein may comprise one or more non-transitory computer readable media storing computer instructions translatable by one or more processors in a computing environment.

A “computer” or “processor” may include any hardware system, mechanism or component that processes data, signals or other information. A computer or processor can include a system with a general-purpose central processing unit, multiple processing units, dedicated circuitry for achieving functionality, or other systems. Processing need not be limited to a geographic location, or have temporal limitations. For example, a computer or processor can perform its functions in “real-time,” “offline,” in a “batch mode,” etc. Portions of processing can be performed at different times and at different locations, by different (or the same) processing systems.

As used herein, the terms “comprises,” “comprising,” “includes,” “including,” “has,” “having,” or any other variation thereof, are intended to cover a non-exclusive inclusion. For example, a process, product, article, or apparatus that comprises a list of elements is not necessarily limited only those elements but may include other elements not expressly listed or inherent to such process, process, article, or apparatus.

Furthermore, the term “or” as used herein is generally intended to mean “and/or” unless otherwise indicated. For example, a condition A or B is satisfied by any one of the following: A is true (or present) and B is false (or not present), A is false (or not present) and B is true (or present), and both A and B are true (or present). As used herein, including the accompanying appendices, a term preceded by “a” or “an” (and “the” when antecedent basis is “a” or “an”) includes both singular and plural of such term, unless clearly indicated otherwise (i.e., that the reference “a” or “an” clearly indicates only the singular or only the plural). Also, as used in the description herein and in the accompanying appendices, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.

Although the foregoing specification describes specific embodiments, numerous changes in the details of the embodiments disclosed herein and additional embodiments will be apparent to, and may be made by, persons of ordinary skill in the art having reference to this disclosure. In this context, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of this disclosure. The scope of the present disclosure should be determined by the following claims and their legal equivalents. 

What is claimed is:
 1. A method, comprising: collecting used market data, non-industry-specific data, and industry-specific data from disparate data sources into a database, the collecting performed simultaneously, continuously, or periodically by a system communicatively connected to the disparate data sources over a network, the system having a processor and a non-transitory computer-readable medium; determining, by the system using the used market data, a baseline value for an item of interest with a base configuration in an industry at an initial time point, the determining comprising taking an average of historical market values from the used market data; determining, by the system at the initial time point, a reference period at which the baseline value for the item of interest is adjusted; determining, by the system, a number of forecasts desired between the initial time point and the reference period; determining, by the system, a locality adjustment to the item of interest at a forecast time, the locality adjustment representing a ratio of an average cost of items in the industry in a locality at the forecast time over a local cost of items in the industry across all localities at the forecast time; determining, by the system, a locally adjusted value of the item of interest as modified at the forecast time; constructing, by the system, competitive sets of similar items, substitute items, or a combination thereof in the industry to which the item of interest belongs; determining, by the system, to which one and only one of the competitive sets the item of interest belongs; determining, by the system using the non-industry-specific data, a macroeconomic factor by taking a set of macroeconomic variables over a plurality of industries, the set of macroeconomic variables representing macroeconomic features; determining, by the system using the industry-specific data, a microeconomic factor by taking a linear combination of observed or forecasted values of microeconomic variables specific to the industry to which the item of interest belongs; generating, by the system at the forecast time, a residual value for the item of interest, the generating utilizing the baseline value for the item of interest at the initial time point determined by the system using the used market data, the macroeconomic factor determined by the system using the non-industry-specific data, and the microeconomic factor determined by the system using the industry-specific data; storing the residual value for the item of interest in a data storage device; and providing the residual value forecast for the item of interest for presentation on a client device over the network.
 2. The method according to claim 1, wherein the system determines the baseline value for the item of interest responsive to a request from a client device communicatively connected to the system over a network, responsive to an instruction or command from an administrator of the system through a user interface of the system, or responsive to a programmed trigger or scheduled event.
 3. The method according to claim 1, further comprising: constructing competitive sets of similar items in the industry of the item of interest, substitute items in the industry of the item of interest, or a combination thereof; selecting a most similar item from the competitive sets, the substitute items, or the combination thereof as a substitute for the item of interest in the industry; and using a baseline value for the substitute as the baseline value for the item of interest in subsequent steps if the baseline value for the item of interest cannot be determined or obtained from the historical market values.
 4. The method according to claim 1, wherein the system determines the reference period based at least in part on a minimum frequency in which input data from the disparate data sources to the system is updated, an expected total lifetime of the item of interest, or a utility of the residual value forecast generated by the system for the item of interest.
 5. The method according to claim 1, wherein the locality adjustment comprises a first modification type and a second modification type, wherein the first modification type represents any modifications made to the base configuration of the item of interest at a time point in the reference period that are observable and are expected to retain some value in future time periods after the reference period, and wherein the second modification type represents any modifications made to the base configuration of the item of interest at the time point in the reference period that are not observable, not expected to retain value, or both.
 6. The method according to claim 1, wherein the used market data comprises open auction data, closed auction data, and certified pre-owned data.
 7. The method according to claim 1, wherein the macroeconomic features comprise gas prices, an economic index, and industry supply, and wherein the microeconomic variables comprise segment supply, model supply, incentive spending, fleet management, redesign, and brand value.
 8. A system, comprising: a processor; a non-transitory computer-readable medium; and stored instructions translatable by the processor to perform: collecting used market data, non-industry-specific data, and industry-specific data from disparate data sources into a database simultaneously, continuously, or periodically over a network; determining, using the used market data, a baseline value for an item of interest with a base configuration in an industry at an initial time point, the determining comprising taking an average of historical market values from the used market data; determining, at the initial time point, a reference period at which the baseline value for the item of interest is adjusted; determining a number of forecasts desired between the initial time point and the reference period; determining a locality adjustment to the item of interest at a forecast time, the locality adjustment representing a ratio of an average cost of items in the industry in a locality at the forecast time over a local cost of items in the industry across all localities at the forecast time; determining a locally adjusted value of the item of interest as modified at the forecast time; constructing competitive sets of similar items, substitute items, or a combination thereof in the industry to which the item of interest belongs; determining to which one and only one of the competitive sets the item of interest belongs; determining, using the non-industry-specific data, a macroeconomic factor by taking a set of macroeconomic variables over a plurality of industries, the set of macroeconomic variables representing macroeconomic features; determining, using the industry-specific data, a microeconomic factor by taking a linear combination of observed or forecasted values of microeconomic variables specific to the industry to which the item of interest belongs; generating, at the forecast time, a residual value for the item of interest, the generating utilizing the baseline value for the item of interest at the initial time point determined by the system using the used market data, the macroeconomic factor determined by the system using the non-industry-specific data, and the microeconomic factor determined by the system using the industry-specific data; storing the residual value for the item of interest in a data storage device; and providing the residual value forecast for the item of interest for presentation on a client device over the network.
 9. The system of claim 8, wherein the system determines the baseline value for the item of interest responsive to a request from a client device communicatively connected to the system over a network, responsive to an instruction or command from an administrator of the system through a user interface of the system, or responsive to a programmed trigger or scheduled event.
 10. The system of claim 8, wherein the instructions are further translatable by the processor to perform: constructing competitive sets of similar items in the industry of the item of interest, substitute items in the industry of the item of interest, or a combination thereof; selecting a most similar item from the competitive sets, the substitute items, or the combination thereof as a substitute for the item of interest in the industry; and using a baseline value for the substitute as the baseline value for the item of interest in subsequent steps if the baseline value for the item of interest cannot be determined or obtained from the historical market values.
 11. The system of claim 8, wherein the system determines the reference period based at least in part on a minimum frequency in which input data from the disparate data sources to the system is updated, an expected total lifetime of the item of interest, or a utility of the residual value forecast generated by the system for the item of interest.
 12. The system of claim 8, wherein the locality adjustment comprises a first modification type and a second modification type, wherein the first modification type represents any modifications made to the base configuration of the item of interest at a time point in the reference period that are observable and are expected to retain some value in future time periods after the reference period, and wherein the second modification type represents any modifications made to the base configuration of the item of interest at the time point in the reference period that are not observable, not expected to retain value, or both.
 13. The system of claim 8, wherein the used market data comprises open auction data, closed auction data, and certified pre-owned data.
 14. The system of claim 8, wherein the macroeconomic features comprise gas prices, an economic index, and industry supply, and wherein the microeconomic variables comprise segment supply, model supply, incentive spending, fleet management, redesign, and brand value.
 15. A computer program product comprising a non-transitory computer-readable medium storing instructions translatable by a processor of a residual value forecasting system for: collecting used market data, non-industry-specific data, and industry-specific data from disparate data sources into a database simultaneously, continuously, or periodically over a network; determining, using the used market data, a baseline value for an item of interest with a base configuration in an industry at an initial time point, the determining comprising taking an average of historical market values from the used market data; determining, at the initial time point, a reference period at which the baseline value for the item of interest is adjusted; determining a number of forecasts desired between the initial time point and the reference period; determining a locality adjustment to the item of interest at a forecast time, the locality adjustment representing a ratio of an average cost of items in the industry in a locality at the forecast time over a local cost of items in the industry across all localities at the forecast time; determining a locally adjusted value of the item of interest as modified at the forecast time; constructing competitive sets of similar items, substitute items, or a combination thereof in the industry to which the item of interest belongs; determining to which one and only one of the competitive sets the item of interest belongs; determining, using the non-industry-specific data, a macroeconomic factor by taking a set of macroeconomic variables over a plurality of industries, the set of macroeconomic variables representing macroeconomic features; determining, using the industry-specific data, a microeconomic factor by taking a linear combination of observed or forecasted values of microeconomic variables specific to the industry to which the item of interest belongs; generating, at the forecast time, a residual value for the item of interest, the generating utilizing the baseline value for the item of interest at the initial time point determined using the used market data, the macroeconomic factor determined using the non-industry-specific data, and the microeconomic factor determined using the industry-specific data; storing the residual value for the item of interest in a data storage device; and providing the residual value forecast for the item of interest for presentation on a client device over the network.
 16. The computer program product of claim 15, wherein the baseline value for the item of interest is determined responsive to a request from a client device communicatively connected to the system over a network, responsive to an instruction or command from an administrator of the residual value forecasting system through a user interface of the residual value forecasting system, or responsive to a programmed trigger or scheduled event.
 17. The computer program product of claim 15, wherein the instructions are further translatable by the processor for: constructing competitive sets of similar items in the industry of the item of interest, substitute items in the industry of the item of interest, or a combination thereof; selecting a most similar item from the competitive sets, the substitute items, or the combination thereof as a substitute for the item of interest in the industry; and using a baseline value for the substitute as the baseline value for the item of interest in subsequent steps if the baseline value for the item of interest cannot be determined or obtained from the historical market values.
 18. The computer program product of claim 15, wherein the reference period is determined based at least in part on a minimum frequency in which input data from the disparate data sources is updated, an expected total lifetime of the item of interest, or a utility of the residual value forecast generated for the item of interest.
 19. The computer program product of claim 15, wherein the locality adjustment comprises a first modification type and a second modification type, wherein the first modification type represents any modifications made to the base configuration of the item of interest at a time point in the reference period that are observable and are expected to retain some value in future time periods after the reference period, and wherein the second modification type represents any modifications made to the base configuration of the item of interest at the time point in the reference period that are not observable, not expected to retain value, or both.
 20. The computer program product of claim 15, wherein the used market data comprises open auction data, closed auction data, and certified pre-owned data. 